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Abstract 

Conventional studies of network growth models mainly look at the steady state degree distribution of the graph. 
Often long time behavior is considered, hence the initial condition is ignored. In this contribution, the time evolution 
of the degree distribution is the center of attention. We consider two specific growth models; incoming nodes with 
uniform and preferential attachment, and the degree distribution of the graph for arbitrary initial condition is obtained 
as a function of time. This allows us to characterize the transient behavior of the degree distribution, as well as to 
quantify the rate of convergence to the steady-state limit. 

I. Introduction 

The network paradigm is ubiquitous, including many fields such as epidemiology (T), sociology Q, (3), Q], 
economics 0, |6), management [7| and marketing |8], [9), ifTOll . to name a few. The dynamics of the phenomena 
under consideration usually depends on the topological properties of the network, drawing attention to its formation 
process. Network formation models for static population has been studied mostly under the category of random 
graphs with Poisson distribution ifTD . 

There are ample applications for which the network evolves through time. In fact most real networks do, trivial 
examples being the society, the world-wide web, scientific citation networks, etc. Examples of conventional models 
for growing graphs are Small-World networks [13] and the so-called preferential attachment models fl4l . Ifl6l . 
usually associated with scale-free properties. In those models the long-time behavior of the graph (the thermodynamic 
limit) is of interest and the asymptotic distribution is investigated for which the initial conditions can be neglected 

ED, ED, ED. 

On the contrary, one might be interested in the short-time regime of the growth process. One encounters this in 
population growth (where incoming nodes are born or are new immigrants that join the pre-existing network of a 
country) or any process explicable in terms of the growth mechanism of the structure, and wherever extrapolation 
of the time evolution of the graph might be of interest. On the other hand, although the eventual behavior is known, 
one might ask how fast does the network reach the steady-state, to make sense of the equilibrium assumptions 
accordingly. 

In this paper we derive closed form expressions for the expected degree distribution over time as a network 
evolves. We focus on two particular growth models. Both models begin with an initial network (the substrate), 
and nodes are added to the network as time advances. In the first model, termed uniform selection, new nodes 
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connect to one or more existing nodes without preference (i.e., drawn uniformly from the pool of existing nodes). 
In the second model, new nodes connect to one existing node by following the preferential attachment model lfl5l . 
whereby the neighbor of the new node is drawn with probability proportional to its present number of neighbors. 
For both models we derive closed form expressions for the expected degree distribution as a function of time. 
These allow for characterization of the rate of convergence to the limiting distributions. In both models, the limiting 
distributions do not depend on characteristics of the initial substrate. 

The rest of this extended abstract is organized as follows. Section [II] presents results for the uniform attachment 



model, and Section III presents results for the preferential attachment model. Both of these sections are comple- 



mented with numerical simulations verifying the theoretical expressions we derive. Section IV offers concluding 
remarks. 

II. Uniform Linking 

In this section we assume the new nodes pick single or multiple existing nodes and connect to them. They choose 
the destination nodes uniformly at random, i.e., regardless of their degree. It is assumed that the existing nodes at 
each timestep do not delete or add links. 

A. Single Connection 

The model begins with an initial network, which we refer to as the substrate, at time t — 0. We assume that 
the substrate contains N(0) nodes, and the number of nodes in the substrate having degree k > 1 is given by 
ATfc(O) = Ufe. We also assume that the substrate is connected. If this is not the case, then under the uniform single 
connection model, the limiting graph will have the same number of components as the substrate, and new nodes 
join one component with a probability proportional to the current number of nodes in that component. 

Suppose new nodes are born at a rate a and connect to an existing node which is drawn uniformly at random 
from the existing population, so at each time step a new nodes with degree 1 are added to the network. Since there 
are N(0) nodes at the outset, the total number of nodes in the graph after time t is iV(0) + at. Let N/.(t) denote 
the expected number of nodes having degree k at time t. We seek the time evolution of the quantities {Nk(t)} 
in order to find an expression for the degree distribution by dividing by the total population at time t afterwards. 
To do so, it is expedient to use the rates of transition between states to study the dynamics ll20l . According to the 
uniform attachment dynamics, the expected degree distribution evolves as follows for t > 0, 

N k {t + At) = N k (t) + - m ^ i (N k - 1 -N k ) fc^l 
N l (t + At) = N 1 (t)- w ^ i + aAt k = l. 

These dynamics can be compactly represented by the following differential equation, 

^ = NWTa-t^- N ^ + aS ^ (2) 
where Nk is the first derivative of Nk(t) with respect to time, 6 k ,i is the Kronecker delta function (i.e., 5k,i = 1 if 
k = 1, and 6k i = 1 otherwise), and dependence on time is suppressed to simplify the presentation. Since the initial 
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network is connected by assumption and every new node connects to one existing node, we have that N (t) = 
for all t > 0. 

In order to solve pi, we will use the generating function ip(z,t) = J^k z ~ Nk(t). We get 



-Xz- 1 -1)^,1) 



dt N{0) + at 
which is solved by multiplying both sides by the integration factor (t 



AT(0Ki_i 



The result is 



iN N{0) + at _ . N{0) i_ 1 



z(2 



and incorporating the initial condition gives 

^ V ' y *(2- i) iV(0) + crf 



of 



^o)(i + i ^ y )* 



7V(0) 2 1 
iV(0) +atz(2- i; 



1 



erf 

iv(o); 



V 



(3) 



(4) 



(5) 



Recalling that in the initial (substrate), Nk(0) = nk nodes have degree k, we find that the solution of Nk(i) is 
given by, 



1 7V(0) 2 



In 1 



{} + N(0)) 



iV(0) ; 



N(0) + at 




In 1 



at 
N(0) 



fc! 



(6) 



where * denotes the convolution operator. Dividing by the total number of nodes at time t gives the expected degree 
distribution, 



1 

Pk(t) = ^ 



N(Q) 



N(0) + at 



In 1 



( x + JV(0)) 




(7) 



Note that as t —> oo, the term involving initial conditions vanishes, and the asymptotic behavior of the degree 
distribution is as follows 

limp fe (t) = i. (8) 



This matches the asymptotic behavior previously found in lfT2l . lfT9l . 

As an illustrative example, let us take the substrate to be a start with iV(0) nodes; i.e., there is one central node 
with degree iV(0) — 1, and all other nodes have a single edge connecting to the central node. We simulate this 
scenario with iV(0) = 10. Figure [T] shows the fraction of nodes with degrees k = 2 - 6 and 9. The solid lines 
show our theoretical expression, and the scatter points for each curve are the average over 10000 Monte Carlo 
simulations. We use a = 1. Figure [2] shows similar results but for using a double-star for the initial substrate. 
The double star has two central nodes which each connect to each other, and each central node has a number of 
satellites. In our simulation, iV(0) = 10 and the central nodes have degrees 4 and 6. As predicted, the limiting 
degree distribution is independent of the distribution of the initial substrate. 
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Fig. 1. Pk(t) for different values of k as a function of time for the star topology in uniform linking scheme. Solid lines represent the theoretical 
prediction. 
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Fig. 2. Pfc(t) for different values of k as a function of time for the double-star topology in uniform linking scheme. Solid lines represent the 
theoretical prediction. 

B. Multiple Connections 

Now, suppose that each newly-born node connects to (3 existing nodes drawn uniformly at random from the 
existing population. Note that this model potentially yields very different results. In the former case (single 
connection), if the substrate begins as a disconnected graph then, with probability 1, the limiting graph contains 
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the same number of connected components as the substrate. On the other hand, in the uniform multiple destination 
model there is non-zero probability that each new node will form a bridge between two components. 
Under the uniform multiple connection model, the differential equation for Nk(t) now becomes 

13a 



The generating function is of the form 

^"-^■"(s^s)" 1 " 1 ' 

and so the populations are given by 



N{0) + at 



(N, 



fe-i 



N, 



aS 



N(0) 



^[1 + /?(!- A)] \N{0) + at 



N(0) 



,8(1- 



N(0) + at 



(9) 



(10) 



N k (t) 



N(0) + at 



(3 + 1 



k-p+i 



u(k-/3) + 

N(0)P +1 
/3(N(0) + atf 



N(0)P +1 
(N(0) + atf 



/31n(l + 



at 
N(Q) 



fc! 




k-p+i 



u(k- (3) 



/31n (l 



at 
N(0) 



fc! 



(11) 



were u(x) is the Heaviside step function (i.e., u(x) = for x < 0, and u(x) = 1 for x > 0). Normalizing by the 
total population at time t, we arrive at the degree distribution: 

fc "\ 



Pk{t) = 







1 



(3 \/3 + l 



k-0+1 



u{k- f3) 




fc! 



k-p+i 



u(k-j3) 



/31n 



at 
N(0) 



fc! 



Asymptotically, the second and third terms vanish, and we have 

k-p+i 



Pk(t) 



1 



u{k-p). 



(12) 



(13) 



(3 V/3 + 1 

Of course, since every new node connects f3 existing nodes, as t — > oo, all nodes have degree at least (3. Likewise, 
taking (3 — 1, we recover the same asymptotic distribution as in the previous section. 

Simulation results are depicted Figures [5] and [4] for the same two substrates as used above, and with j3 = 4. 

III. Preferential Linking 

It might be the case that nodes are not impartial to the status of the nodes they want to select for attachment. 
There are situations exhibiting the Mathew effect, where popularity is self-reinforcing (rich-gets-richer). Scientific 
citation networks are examples of that [21 1, l22ll . |23 1. Suppose each new node connects to an existing node drawn 
with a probability proportional to a function of its current degree. Any increasing function of degree can be used to 
model preferential attachment; here we focus on the case where probabilities are linearly proportional to degree lfT31 . 
The evolution of Nk(t) is given by the differential equation 

a 



((k — l)iVfe_i - kN k ) + a8k, 



(14) 
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Fig. 3. Pk(t) for different values of fc as a function of time for the star topology in uniform linking scheme where each new node connects 
to /? = 4 existing nodes. Solid lines represent the theoretical prediction. 

0.35 r 




Time 

Fig. 4. Pk(t) for different values of fc as a function of time for the double-star topology in uniform linking scheme where each new node 
connects to /? = 4 existing nodes. Solid lines represent the theoretical prediction. 

Recall that Nk(t) = (N(0) + at)p k (t). We use the generating function for Pk(t) defined as 

rP(z,t) = ^> fc (^- fe . (15) 

k 
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Multiplying ( fl4| ) by z k and summing over k, we get 



V W ; (7V(0)fco + 2at) <9z 



aip, 



(16) 



where fco is the average degree of the substrate. Equation ( [To} is a linear partial differential equation which has 
characteristic curve (z — l) 2 (iV(0)fco + 2at) — C. Using this, the solution has the form 

a$(C) \ 



N(0) + at N(0) + at 



2JC(N(0)k + 2at) + (N(0)k o + 2at) + 2Cln(VC 



where $(C) is uniquely determined by the initial conditions N(0) and fco. 
Define the constant 

c= 1 



' ^V(0)fco 
N(0)k + 2at' 



(N(0)k o + 2at)) 
(17) 



(18) 



The generating function simplifies to 

iv((Mf^o) 

^ t)= N{0) + at 
and the degree distribution is thus 



N(0)k o + 2at 
N{0) + at 



k=l 



„k+2 



k + 2 



(19) 



Pk(t) 



N(0)k + 2at ( c k+2 



N(0) + at \k + 2 
, ( N(0) 



c k+i 

> 

J k + 1 

k 



\N{0)+at / 



E 

m— 1 



k - 1 

TO — 1 



iV(0)fc 



N(0)k o + 2at 



1 - 



N(0)ko 



N{0)k + 2at 



k — r 



(20) 



Note that c — > 1 as t —> oo, and the second term on the right hand side also vanishes as t —> oo. Thus, the 
asymptotic behavior is 



Pfc(t) 



1 2 
F+2 + 1 



which simplifies to 



Pk 



(21) 



(22) 



fc(fc + l)(fc + 2)' 

This asymptotic result was previously found in lfT8l , which is equivalent to the fc -3 power law for the so-called 
scale-free graphs lfl4l . 

To verify this result we take the same initial graphs as before (start and double-star with N(0) — 10). The 
analytical result is compared with the simulation results in Figures [5] and [6] Again, the scatter points indicate 
averages over 10000 Monte Carlo trials with a = 1. 

IV. Summary 

Previous studies of network growth models have primarily focused on the steady-state degree distribution. In 
this paper we find the exact degree distribution of a growing network as a function of time, for arbitrary initial 
conditions. This can be used to determine rates of convergence to the asymptotic limits. Understanding the short 
time behavior of graph evolution may also be useful in situations where the model parameters a and/or (3 vary 
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Fig. 5. Pk{t) for different values of A; as a function of time for the star topology in preferential linking scheme. 
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Fig. 6. pfc (t) for different values of A; as a function of time for the double-star topology in preferential linking scheme. 



slowly with time. In these cases, the existing asymptotic analysis no longer applies, but it may be reasonable to 
treat the parameters as constant over short time scale and finding a piece-wise solution. 

For the growth process, uniform and preferential linking of the newly born nodes are considered and theoretical 
predictions are compared with simulations. The full version of the paper will also contain results for a preferential 
attachment model where new nodes make connections to multiple existing nodes. 
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